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Abstract 

We consider invasion percolation on the square lattice. In [3] it has 
been proved that the probability that the radius of a so-called pond is 
larger than n, differs at most a factor of order log n from the probability 
that in critical Bernoulli percolation the radius of an open cluster is 
larger than n. We show that these two probabilities are, in fact, of the 
same order. Moreover, we prove an analogous result for the volume of 
a pond. 

1 Introduction 

Invasion percolation is a stochastic growth model of an interesting self- 
organised critical nature: it has characteristics that resemble critical Bernoulli 
percolation, even though the definition of the invasion process does not in- 
volve any parameter (see [HCE]). Comparison of the two processes helps to 
gain new insights into both of them (see e.g. [51 171 ITOl [T]). 

Recently a new comparison result, relating a so-called 'pond' in invasion 
percolation to a critical Bernoulli percolation cluster, has been proved in [3]. 
This result is sharpened and extended in the present paper. 

In the remainder of this Section we define the invasion percolation model 
and state our main results. The proofs, and important prerequisites, are 
given in Section [2j 

For general background on percolation, see [6]. 

Consider the hypercubic lattice Z d with its set of nearest neighbour bonds 
K d . If an edge e has endpoints v and w, we write e = (v, w). For an arbitrary 
subgraph G = (V,E) of (Z d ,E d ), we define the outer boundary AG as 

AG = {e = (v, w) £ E d : e £ E, but v £ V or w £ V}. 
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Invasion percolation is defined as follows. Let r(e),e G K d , be independent 
random variables, uniformly distributed on the interval [0,1]. Using these 
variables, we construct inductively an increasing sequence Go, G±, G2, ■ ■ ■ of 
connected subgraphs of the lattice. Go only contains the origin. If Gi = 
(Vi,Ei) has already been defined, we select the bond ei + \ which minimizes 
r on AGj, take = Ei U {ej + i}, and let Gj+i be the graph induced by 
the edge set Ei+±. The graph Gi is called the invaded cluster at time i, and 
Goo = (Voo, Eoo) = U^ Gj is the invaded region at time infinity. 

Invasion percolation can be coupled to Bernoulli bond percolation in the 
following standard way. Let < p < 1. For each bond e we say that e is p- 
open, if r(e) < p. One can then define, in an obvious way, p-open paths and 
p-open clusters, and the study of these objects corresponds with Bernoulli 
bond percolation with parameter p. 

From now on we will only consider the case when d = 2. It is well- 
known and easy to see that for every p £ [0, 1] the following holds: once the 
invasion reaches an infinite p-open cluster, it never leaves it again. Further, 
it is a classical result for 2D Bernoulli percolation that for every p > p c 
(which equals 1/2 on the square lattice) there is (a.s.) a p-open circuit 
that surrounds O and belongs to the infinite p-open cluster, and that (a.s.) 
there is no infinite p c -open cluster. These properties easily imply that (a.s.) 
f := maxeg^ r(e) exists and is larger than p c . Let e denote the edge where 
the maximum is taken and suppose that it is added to the invasion cluster 
at step i + 1. Following the terminology in [13], the graph G- = (V, E) is 
called a 'pond', or, more precisely, the first pond of O. Since the invasion 
can be started at any vertex v, not necessarily O, we have the more general 
notion 'first pond of v\ 

The above defined 'pond' is a very natural object (see [13] and [3]), and 
has several interpretations, for instance the following. In this (somewhat 
informal) interpretation each vertex (x,y) G 1? represents a 'polder': the 
square piece of (flat) land [x — 1/2, x + 1/2) x (y — 1/2, y + 1/2), surrounded 
by four dikes, corresponding with (the dual edges of) the four edges of (x, y). 
The heights of the dikes are the r values of the corresponding edges. Now 
suppose that water is supplied from some external source to the polder rep- 
resented by the vertex O. The water in this polder will rise until its level 
reaches the height of the lowest of its four dikes, say a. Then the water 
starts spilling over that dike, so that the level in the neighbouring polder (on 
the other side of the dike) starts to rise. If each of the other three dikes of 
that neighbouring polder is higher than a, the water in the polder of O will 
remain at level a until the above mentioned neighboring polder has reached 
this same water level, after which the level in both polders rises ('simultane- 
ously') until it reaches the height of the lowest of the six dikes bounding the 
union of these two polders, etc. On the other hand, if the neighboring polder 
of O has a dike with height b < a, the water level in this polder will rise up 
to level b and then starts spilling over that dike to a third polder (while the 
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level in the polder of O is still a) etc. In any case, (a.s.) eventually the water 
level at O will remain constant forever, namely at the level f defined above, 
and the 'connected' set of polders with the same final level as O is the above 
defined 'first pond'. Since water keeps being supplied, the 'surplus' water 
will spill over the lowest dike (corresponding with the earlier defined e), on 
the boundary of this pond: the outlet from this first pond to a second (lower 
level) pond. For clarity we note that for each vertex in the latter pond, this 
pond plays the role of 'first pond'. 

For further clarity we also note that the above 'hydrologic' interpretation 
has a more 'symmetric' version as follows: Now at each vertex there is an 
external water source (rain, e.g.). Again each polder has a final water level, 
and the maximal connected set of polders with the same final water level, 
containing a given vertex v, is the earlier defined (first) pond of v. Then, if 
V(v) denotes the vertices of the first pond of v, the collection {V(v)} v€ x 2 is 
a random partition of Z 2 which is stationary under translations. 

Before stating the results, we first fix some notation. Let R := max{|x| + 
\y\ : (x,y) £ V} be the radius of the first pond. Let P cr denote the product 
measure corresponding to critical Bernoulli bond percolation. Let B(n) de- 
note the box [— n,n] 2 and dB(n) := B(n) \ B(n — 1). 

Let A and B be sets of vertices. In the context of Bernoulli percolation, we 
denote the event that there is an open path from A to B by {A «-> B}. In 
the context of invasion percolation we denote the event that there is a p-open 
path from A to B by {^4 B}. To indicate that there is an infinite open 
(or p-open) path from A, we use the same notation with B replaced by oo. 
We use the notation g(n) « h(n), n — ► oo to indicate that 

log g{n) 

- — — - -> 1, as n -> oo, 
log n[n) 

and g(n) x h(n) to indicate that g(n)/h(n) is bounded away from an oo. 

Van den Berg, Peres, Sidoravicius and Vares have proved the following the- 
orem: 

Theorem. Proposition 1.3] 

P(R > n) ss P CT .(0 <-► dB(n)), n -> oo. (1.1) 

Using ideas and techniques from [7], we obtain the following improvement 
of the theorem above. 

Theorem 1. 

P(R > n) x P cr (0 dB(n)). (1.2) 

Moreover, we show that not only the radius but also the volume of the 
pond behaves like that of a critical percolation cluster: Let 

s(n) = n 2 P cr (0 dB(n)) C(0) = {v G I? : v}. 
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Theorem 2. There exist constants < c, c' < oo, such that 

cP cr (0 <- dB{n)) < P cr (\C(0)\ > s(n)) < P(\V\ > s(n)) 
< c'P cr (0 <-> ftB(n)). 

Corollary 3. 

P(\V\ > n) x P cr (|C(0)| > n). (1.4) 

Remark: These results, and the proofs in Section [2] also hold (with some 
obvious adaptations) for the triangular and the hexagonal lattice. 

2 Proofs of the main results 

In the following all the constants are strictly positive and finite without 
further mentioning. 

2.1 Preliminaries 

Let 

a(n,m,p) = /'(there is a p-open horizontal crossing of [0, n] x [0, m]), 

where it is assumed that the crossing does not use bonds lying on the top or 
the bottom sides of the rectangle. Given e > 0, we define, for p > p c , 

L(p, e) = min{n : a(n, n,p) > 1 — e}. 

It is shown in (9J (1.24)], that there exists an £o > such that for all 
e < so, the scaling of L(p,e) is independent of e in the sense that for all 
fixed < £i,£2 < £o the ratio L(p, E\)/L(p, £2) is bounded away from both 
and 00 as p [ p c . We let L(p) = L{p,eo) for the entire proof. Below we 
list some properties of L(p) that will play a crucial role in the proof of our 
results. The first two follow fairly easily from the definitions and standard 
arguments (see Section 2.2 in [7] for further explanation and references). The 
third is (a consequence of) a deep result in [9] 

1. L(p) is decreasing, right continuous and L(p) — » 00 as p | p c . 

2. There is a constant D such that 

3. Theorem [9], Theorem 2] There are constants Co > and C\ such 
that for all p > p c 

C P cr [0 ~ dB(L(p))] < 8{p) < CtPar [0 <- dB(L{p))] , (2.2) 

where 6{p) = P p (0 <-> 00) is the percolation function for Bernoulli 
percolation. 
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Finally we mention the following result on the behavior of -P cr (0 
dB(n)). It is believed (see Chapters 9 and 10 in [6] for background) that for 
2D percolation on sufficiently 'nice' 2D lattices this has a power law (with 
critical exponent 5/48) but so far this has only been proved for site perco- 
lation on the triangular lattice (see [E]). The following is sufficient for our 
purpose. 

There exists a constant D\ such that 

P cr (0 <-> dB(n)) frn 

P g (0.9%)) ^ln' l<m<n. (2.3) 

For m = 1 this was proved in [21 Corollary (3.15)]. For general m it can be 
proved in a similar way, using a block argument. 



2.2 Proof of Theorem Q] 

Proof. As it is pointed out in [3], it is very easy to see that 

P(R >n)> P cr (0 <-► dB(n)), (2.4) 

since the whole p c -open cluster of the origin is invaded before any edge with 
r value larger than p c is added to the invasion cluster. To prove that the l.h.s. 
of p.4p is smaller than some constant c times the r.h.s. is more involved. 
First note that it suffices to prove this for the case that n is of the form 2 k . 
Indeed, if it holds for those special cases then, for any 2 fc ~ 1 < n < 2 k we 
have 

P(R > n) < P(R > 2 k ~ l ) < cP cr [0 < — ► dB(2 k - 1 )] 

9 cP cr [0 < — ► dB{2 k )] < cP cr [0 < — ► dB(n)] . 

First some additional notation and definitions. As in [7] we define log^ ^ k = 
k and log^ k = log(log^~ 1 ^ k) for all j > 1, as long as the right-hand side 
is well defined. For k > 10 let 

log* k = min{j > : log^ k is well-defined and log^ k < 10}, (2.5) 

where the choice of the constant 10 is quite arbitrary. Clearly, log^ k > 2 
for j = 0, 1, ... , log* k and k > 10. Further, 

p k {j) ■= inf {p > p c : L(p) < 2 }, (2.6) 

where the constant C2 will be chosen later. It is easy to see that Pk(j) is 
well-defined for all sufficiently large k (in fact, for all k with 2 k > C2&), and 
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that the sequence {pfc(j)}j°J ^ ' m decreasing in j. The definition of Pk(j) 
together with the right continuity of L(p) and (12. ip readily implies that 

C 2 log (j) k < r , 2 * < DC 2 log^ k. (2.7) 
L (Pk(j)) 

Now we decompose P(R > n) according to the value of f as follows, where 
we note that since r has a continuous distribution, f does not coincide with 
Pk(j) for any j = 0, . . . , log* k, almost surely. 

P(R >n) = P(R > n,p k (0) < t)+P(R >n,f< p k (log* k)) 

log* k-1 / 2 g\ 

+ P(R>n,p k (j + l)<f<p k (j)). 

3=0 

To bound the terms in (|2.8|) we will use the following observations made in 
[3]. Let p be an arbitrary number between p c and 1. 

Observations 

(a) f < p if and only if the origin belongs to an infinite p-open cluster. 

(b) If r > p and R > n, then there is a p-closed circuit around O in the 
dual lattice with diameter at least n. 

The event in observation (b) will be denoted by A n ^ p . 

A n , P '■= {3 p-closed circuit around O in the dual with diameter at least n}. 

Starting with the first term of (|2.8p . Observation (b) gives 

P(R > n,p k (0) < f) < P(A nM0) ). (2.9) 

It is well-known (see [3] for more explanation and references) that there exist 
C3 and C4 such that for all p > p c , 

P(A n>p )<C 3 exv{-^) (2.10) 

Using the lower bound in (|2.7p and the definition of log^ ^ k we get that 

r Cati 1 1231 
P (^(0))<G 3 ex P {- z ^y} < C 3 n~ c ^ (2.11) 

As mentioned above, we have P cr (0 <-> dB(n)) > CrT 1 ! 2 . Hence, by taking 
C2 > I/C4, we can ensure that 

P(A n>Pki0 )) < C^ 1 < C 3 P cr (0 ~ dB{n)). 
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Remark: For future purpose we will even take C2 > 2/C4. 
For the second term of (|2.8p we apply observation (a) to get 

Obs. (a) 

P(R >n,f< Pk (log* k)) < P(f < Pk (log* k)) < e( Pk (log* A;))- 
Furthermore, using (12. 2|) . (|2.7p . the definition of (log* fc) and (|2.3p . we have 



#(p fc (log* fc)) < CiP cr [0 <- aB(L(p fc (log* fc) 
<C 5 Pcr[0^SB(n)], 



for some constant C5. 

Now let us consider a typical term in the summation in (12. 8h , The two 
observations a few lines below (|2.8p (and the definition of A n ^) give 

P(£>n,p fc (j + 1) <T<p k (j)) 

<P(0^oo,A n , Pfc0 . +1) ) (2-12) 

< %fc(j))P(^n, Pfc (j+l)), 

where in the last inequality we use the Harris-FKG inequality p3, Section 
2.2]. To bound the first factor in the right hand side of (|2.12|) . note that 



J2J3 

0( Pk (j)) < CiP^O <- L( Pk (j))) 

-rp (n++ xpr^ PcrjO " L(p k (J))) 

- dP cr (o ~ ap(2 )) Pcr(0 - dB{2k)) 

1Q /7, / o fc \ 1/2 ^ 2 - 13 ^ 

< ^P cr .(0^ dB{2 k ))< 



? ^P C r(o <- ap(2 fc ))(DC 2 io g ^) *o 1/2 - 

The second factor in the right hand side of (|2.12p can be bounded using 
(l2~TU1) . CT . (ESD and the choice of C 2 : 

P(^, Pfc0+ i)) < C 3 exp { - L{p ^ n +1)) } < C 3 (log^ k)~\ (2.14) 

Combining ff2TT3|) and (|2TT4l gives 

0( Pfc (j))P(^n, Pfc o- + i)) < C 8 (logW fc)^ 1 / 2 P cr [0 <- 0B(n)]. (2.15) 
To conclude the proof it suffices to show that 

log* k—1 

sup V (log (j) k)' 1 ' 2 < 00. (2.16) 
fc>io J=0 
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Recall from the definitions that log^ k > 2. Applying this to the case 
j = log* k shows that the last term in the sum in (12 . 16|) is at most (e 2 )" 1 / 2 . 
Similarly, the penultimate term is at most (exp(e 2 ))" 1 / 2 , etc. This leads to 
the finite upper bound Cg := -4= H — J= + ... for the l.h.s. of (12.161) . 

Putting everything together we get 

P(R >n)< (C 3 + C 5 + C 8 C 9 )P cr [0 <-> dB(n)} . 

□ 



2.3 Proof of Theorem H 

For short, we use the following notation: 

7r(n) = P cr (0 <-> dB(n)); 
vr(n,p) = P p (0 «-> 9J3(n)). 

Recall that s(n) = n 2 7r(n). 

The difficult part of Theorem [2] is the third inequality. We need the 
following key ingredient. 

Lemma 4. There exist constants C±q and C\\, such that 



P p 0«oo, |C(0) n B(2 k )\ > s(n) 



< 6{p) 2C W exp{ -(2C; 



'11; 



-1 



s(n) 



2 2fe vr(2 fc ,p) 

Proof. The proof is based on the following moment estimate: 



P >Pc,2 < n. 



E p [\C(0)nB(2 



k\\t 







oo < C w t\ 



C u 2 2k ir(2 k ,p) , t>l. (2.17) 



Very similar estimates were proved in [HI Theorem (8)] and in [TT]. To adapt 
their proofs in order to obtain (12. 17ft . one merely needs that the inequal- 
ity ^m=o 7r ( n '^') — Cnir(n,p) holds for all p > p c (with some constant C 
independent of p). From (|2.17|) . we readily get 



E p exp < A 



|C(0) DB(2 k ) 
2 2k n{2 k ,p) 



oo < C 



1 
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1 - ACn 



< A < C n l . 



Taking A = (2Cn) 1 we easily obtain the estimate of the lemma. 



□ 



Proof of Theorem^ The first inequality follows from [HJ Remark (9)]. The 
second inequality follows immediately from the fact that the p c -open cluster 
containing the origin is a subset of V. 

The third inequality will be proved by a decomposition, somewhat similar 
to the one in Theorem [TJ but now two-fold: this time we will also decompose 
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according to the value of R. As in the proof of Theorem HI without loss of 
generality we may assume that n is of the form 2^. 
Let 

E n ,k = {2 fc ~ 1 < R < 2 k , \V\ > s(n)}. 
Note that s(n) > C 12 n^ 2 , and \B(2 k )\ < C 13 2 2k . Letting 

k := max{k : C 13 2 2k < C 12 n 3/2 }, 

for k < k , R < 2 k implies \V\ < C 13 2 2k < s(n), and hence E n , k = 0. 
Therefore, we can write 

N 

P(\V\ > s(n)) <P(R>n)+Y, P(E n ,k). (2.18) 

k=ko 

The first term on the right hand side is at most Ci47r(n), by Theorem [H 
Consider now a general term of the sum. We decompose this according to 
the value of f as follows: 

P(2 k ^ <R<2 k , \V\ > s(n)) 

log* k ^ 2 19) 

= P{E n , k , f > p fc (0)) + p ( E n,k, Pk(j + 1) < f < Pk(j)), 

3=0 

where we let pk{log* k + 1) = p c . 

We first look at the event in the first term on the right hand side. This 
event implies the occurrence of A 2 fc-i pfc (o)- Hence, by virtue of (|2.11|) . its 
probability is at most Ci 5 (2 2k )~ CiC2 . By the choice of C 2 , we have C 4 C 2 > 1. 
Hence the sum over k < k < N is bounded by Ci 6 (2 2fco )- 1 . By the definition 
of ko, this is o(-7r(n)). 

Consider now the event in the general term on the right hand side of 
(|2.19l) . This event implies the following two events: 

(i) A 2 k-i >pk (j +1 y, 

(ii ) {0 m oo, \C(0;p k (j))nB(2 k )\>s(n)} ] 

where C(0;p) denotes the p-open cluster of 0. Since (i) is a decreasing and 
(ii) an increasing event, the Harris-FKG inequality yields that the general 
term in (|2.19p is at most the product of the probabilities of event (i) and 
event (ii). 

As to event (i), the same arguments that led to (|2.14p (and noting the Remark 
a few lines below (|2.1ip ) show that for j < log* k this has probability less 
than or equal to 

C 3 (logW ky 1 (2.20) 
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It is easy to see that, after increasing the value of C3 if necessary, this bound 
even holds for j = log* k. 

As to event (ii), by Lemma [U this has probability at most 

e( Pk U))(2C 10 ) exp |-(2C 11 )- 1 22k n ffi pk{j)) } ■ (2-21) 

Applying the first inequality in (|2.2|) to the probability in the exponent in 
(|2.2ip . and then applying (|2.13j) twice, shows that (|2.21|) is at most a constant 
times 

vr(2 fc )(logW k)V 2 exp |-C7 18 |^(logW fc)-Va j . (2.22) 
Combining this with ()2.20p gives that the general term in (|2.19p is at most 
C(n)(logO) k)~y^ exp {-C u g^(logCfl fc) - 1/a j (2 23) 

Due to (|2.3p . this as at most 
C 19 7r(n)(log^') fc)^ 1 /2 2 (^fc)/2 exp /-<7 20 2^- fe X 3 / 2 )(log«) £0~ 1/2 } . (2 .24) 

We split the sums over j and k into two parts: 

(1) 2^) < (log^ k) 1 ' 2 - 

(2) 2^- fe ) > (logW fc)V2. 

In case (1), we bound the exponential in (|2.24p by 1, and we have 

(log&'> / fc)- 1 /2 2 (^ fc )/2 < (log(j ) fe) -l/4 < C2l(log (i) iV)-V4. 

The number of possible values of k is at most 

(21og2)^ 1 log( J ' +1 ) k < C 2 2(log a) AO 1 / 8 . 
Hence the contribution of this case is bounded by 

log* N 

J2 (log (j) A0- 1/8 <C 23 . 

j=0 

In case (2), we bound the exponential by exp{— C2o^ N ~ k ^ 2 }, and we have 
(log^ / t)-i/2 2 (A^-fc)/2 < 2 ( N - k )/ 2 . The sum over k can be bounded as follows: 

* N - k)/2 exp{-C 20 2( Ar - fc )/ 2 } < C 24 ex P {-C 25 (logW A) c ^}, 

fc:7V-/c>clog( J ' +1) JV 

for some c\ > 0. The sum of the right hand side over j is again bounded. 
This proves the theorem. □ 
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